Skip to content

feat: GroundingTarget + GroundingCandidate data model for cascade architecture#256

Merged
abrichr merged 1 commit intomainfrom
feat/grounding-target-data-model
Mar 31, 2026
Merged

feat: GroundingTarget + GroundingCandidate data model for cascade architecture#256
abrichr merged 1 commit intomainfrom
feat/grounding-target-data-model

Conversation

@abrichr
Copy link
Copy Markdown
Member

@abrichr abrichr commented Mar 31, 2026

Summary

Phase 3 of the grounding cascade design — the foundation data model.

  • grounding.py: GroundingTarget (rich target per click step) + GroundingCandidate (normalized tier output)
  • demo_library.py: DemoStep.grounding_target field with serialization/deserialization
  • Fix demo description: "Clear data" → "Clear now" (matched actual button text)
  • Design docs: v1, v2, v3 cascade architecture

Every downstream phase (state narrowing, text anchoring, CLIP retrieval, verification) depends on this data model.

🤖 Generated with Claude Code

…hitecture

Phase 3 of the grounding cascade design (v3):

- grounding.py: GroundingTarget (rich target per click step — description,
  crop, nearby text, window title, structured transition expectations)
  and GroundingCandidate (normalized output from each grounding tier)
- demo_library.py: DemoStep gains optional grounding_target field,
  serialization/deserialization handles GroundingTarget objects
- Fix demo description: "Clear data" → "Clear now" (actual button text)
- Design docs: v1, v2, v3 cascade architecture

GroundingTarget is the foundation for the entire cascade. Every
downstream tier (OCR, CLIP, UI-Venus, GPT-5.4) operates on the same
rich signal instead of a weak description string.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@abrichr abrichr merged commit e912b65 into main Mar 31, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant